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METHODS OF MODIFYING THE PRODUCTION OF ISOPENTENYL 
PYROPHOSPHATE, DIMETHYLALLYL PYROPHOSPHATE AND/OR 

ISOPRENOIDS 

BACKG RO UND OF T HE INVENTION 
5 Field of the Invention 

The present invention is directed to genes encoding deoxyxylulose-5- 
phosphate synthase (dxps), glyceraIdehyde-3-phosphate dehydrogenase (gapd), 
and the lytB gene product, as well as vectors containing the same and hosts 
transformed with said vectors. The present invention also provides methods for 
10 modifying the production of isopentenyl pyrophosphate (IPP) and/or dimethylallyl 
pyrophosphate (DMAPP) and/or an isoprenoid compound (i.e., a compound derived 
from IPP and/or DMAPP). Additionally, the present invention provides a method 
for screening for procaryotic and eukaryotic genes encoding enzymes that 
participate in the nonmevalonate pathway leading to IPP and DMAPP. 



15 Background o f the I nvention 

A plethora of chemical compounds are produced by what are collectively 
known as the pathways of isoprenoid (a.k.a. terpene) biosynthesis. The one feature 
in common to the many isoprenoids (more than 20,000 have been identified in 
plants; Chappell, 1995) is their biosynthesis from the central metabolites and 

20 building blocks for all isoprenoid compounds: the 5 carbon compounds isopentenyl 
pyrophosphate (IPP) and its allylic isomer dimethylallyl pyrophosphate (DMAPP). 
The interconversion of IPP and DMAPP is reversible, and is catalyzed by the 
enzyme IPP isomerase (IPI). Using the IPP and DMAPP building blocks in various 
combinations, a modular assembly process that produces compounds of 5, 10, 15, 

25 20 or more carbons (in multiples of 5) allows the biosynthesis of the basic skeletons 
for the many and various isoprenoids with a relatively small number of basic 
reaction steps. The C^ skeleton of plant carotenoid pigments, for instance, is 
assembled from two molecules of a C 20 compound, geranylgeranyl pyrophosphate 
(GGPP), that is itself assembled from 3 units of IPP and 1 unit of DMAPP, and that 

30 also serves as a precursor for many other branches of the isoprenoid pathway in 
plants. 

Isoprenoids are formed from IPP and DMAPP in at least three different 

1 
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compartments of plants cells: the cytosol/endoplasmic reticulum, the mitochondria 
(and/or Golgi apparatus), and the plastids (McGarvey and Croteau, 1995). The 
source of IPP and DMAPP for isoprenoid biosynthesis in these different 
compartments has long been a matter of some controversy and debate (see Bach, 
1995; McGarvey and Croteau, 1995). The well-known "classical" or 
acetate/mevalonate route to IPP and DMAPP proceeds from acetyl-CoA via 3- 
hydroxy-3-methylglutaryl-CoA (HMG-CoA) and mevalonic acid (MVA). The critical, 
rate-determining and irreversible step in this route in animals is the reduction of 
HMG-CoA to produce MVA, catalyzed by the enzyme HMG-CoA reductase 
(HMGR; EC 1.1.1.34; reviewed in Goldstein and Brown, 1990). This enzyme also 
appears to provide an important control point for substrate flow into certain 
isoprenoids in plants (Bach, 1986; Chappell et al., 1995; Schaller et al. f 1995; 
Stermer et al., 1994). HMGR has been localized in the cytosol of plants, 
associated with membranes of the endoplasmic reticulum (reviewed in Bach, 1995). 
There is no convincing experimental evidence that this key enzyme also resides in 
the plastids or mitochondria of plants. An uptake by plastids of IPP to support 
isoprenoid biosynthesis in this organelle has been suggested in the past (Kreuz and 
Kleinig, 1984). However, it has been difficult to reconcile a cytosolic source of IPP 
for plastid isoprenoids with the effective inhibition of cytoplasmically-made 
isoprenoids (e.g. sterols) but not of plastid-made isoprenoids (e.g. carotenoids) or 
mitochondrial ones (quinones) .by mevinolin, a specific inhibitor of HMG-CoA 
reductase (discussed in Bach, 1995 and McGarvey and Croteau, 1995). 
Furthermore, the demonstrated ability of isolated chloroplasts of Acetabularia 
(Moore and Sheperd, 1977) and spinach (Schulze-Seibert and Schultz, 1987) to 
synthesize carotenoids and other isoprenoids from 14 C0 2 necessarily implies a 
chloroplast pathway for IPP biosynthesis. Further confusion arose from reports that 
radiolabeled acetate is efficiently incorporated by isolated chloroplasts into fatty 
acids but not into carotenoids (see Kleinig, 1989; Liedvogel, 1986; Schulze-Seibert 
and Schultz, 1987), a puzzling observation that has been explained by invoking a 
metabolic "channeling" that somehow renders the newly-formed acetyl-CoA pool 
available for fatty acid biosynthesis but unavailable for IPP biosynthesis (Kleinig, 
1989), 
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An attractive explanation for these many puzzling and apparently 
contradictory observations is provided by the suggestion of an "alternative" or non- 
mevalonate pathway for the biosynthesis of IPP and/or DMAPP in the chloroplasts 
of plants (Lichtenthaler et al., 1997) and in algae (Schwender et al. t 1996). 
5 Existence of a distinct non-mevalonate biochemical pathway to IPP and/or DMAPP, 
using pyruvate rather than acetate as a substrate, was posited for certain bacteria, 
including Escherichia coli, as early as 1981 (Pandian et ai^ 1981), but this 
alternative pathway in bacteria received little attention until relatively recently (Zhou 
and White, 1991; Rohmer et ah, 1993 and Horbach et al., 1993; see Bach, 1995 

10 for a critical discussion of this topic). It appears that many bacteria do not have the 
acetate/mevalonate pathway for biosynthesis of IPP. For instance, a BLAST 
search of the £. coli genome, the entire sequence for which has now been 
deposited in GenBank, does not identify any open reading frames with significant 
similarity to known bacterial, plant or mammalian HMG-CoA reductases. Nor can 

1 5 we discern an HMG-CoA reductase homologue in the genome sequence data base 
of the cyanobacterium Synechococystis 6803. Because ancestors of the 
cyanobacteria are the presumptive progenitors of plant chloroplasts, the non- 
mevalonate route to IPP in these photosynthetic procaryotes is likely to resemble 
that in plant chloroplasts. 

20 Recent reports indicate that plant and algal chloroplast isoprenoids, including 

carotenoids (Lichtenthaler et al., 1997; Schwender et al., 1996), and the medically- 
important isoprenoid paclitaxel (a.k.a. TAXOL®) (Eisenreich et al., 1996) are made 
from IPP supplied via a non-mevalonate pathway. Whether this pathway supplies 
IPP for carotenoids and other isoprenoids manufactured in the various 

25 nonphotosynthetic plant plastids (e.g. chromoplasts, leucoplasts, amyloploasts) or 
in etioplasts or developing chloroplasts is an open question. Several recent reports 
describe a thiamin-dependent enzyme from E. coli (Sprenger et a/., 1997; Lois ef 
a/., 1998) and from peppermint (Lange et al., 1998) that converts pyruvate and 
glyceraldehyde-3-phosphate (GAP) to deoxyxylulose-5-phosphate (DXP). A 

30 homologue of this enzyme was earlier described for Arabidopsis thaliana (Mandel 
et a/., 1996), but the function of the A. thaliana polypeptide had not been 
ascertained. 
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The evidence is compelling that DXP is a substrate in the pathway leading 
to IPP or DMAPP in plants, algae, and certain bacteria including E. coli (Arigoni et 
al., 1997). However, DXP also is a substrate leading to other essential compounds 
in these organisms, including thiamin and pyridoxal (see Lois et ah, 1998). 
5 Therefore the biochemical reaction catalyzed by the DXP synthase enzyme is not 
dedicated or restricted to providing substrate for the pathway or pathways leading 
to IPP and/or DMAPP, and isoprenoids derived therefrom. Because the product 
of the reaction is shared with several other pathways, the DXP synthase enzyme 
would not seem to be an obvious candidate for a controlling step in the pathway 

10 leading to isoprenoids. An ability to modify isoprenoid production, enhancing 
production of desired compounds or reducing that of undesirable compounds, 
would be quite advantageous in many applications. 

Plausible routes to IPP from pyruvate and GAP via DXP have been 
suggested for plants and bacteria (Schwender et al., 1 996; Rohmer et al, 1 993 and 

15 1996; see Bach, 1995). A gene encoding what may be the second enzyme in the 
pathway, DXP reductoisomerase, has recently been identified in Escherichia coli 
(Takahashi et al., 1998). The precise nature and sequence of subsequent 
biochemical reactions in the nonmevalonate pathway, and whether regulation of 
flux through this pathway occurs primarily at a single controlling step, as with HMG- 

20 CoA reductase for the mevalonate pathway, are so far unknown. We have taken 
a genetic approach to delineate the pathway leading to IPP and DMAPP in E. coli 
and plants. 

We earlier made the surprising observation that the reversible 
interconversion of IPP and DMAPP limits the accumulation of carotenoids in E. coli 

25 (an observation alluded to in Sun et al., 1996 and Sun et al M 1998). Others 
(Kajiwara et al., 1997) have made a similar observation. As a direct result of 
continuing to pursue the same empirical approach (using a colored isoprenoid 
compound to "report" on the amount of IPP and DMAPP available for isoprenoid 
biosynthesis) which revealed that the isomerization of IPP is a limiting step in 

30 isoprenoid biosynthesis (see U.S. Patent No. 5,744,341), we have now identified 
several other genes which can be used to modify IPP, DMAPP and/or isoprenoid 
production in host cells. 
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For two of the these genes the functions of the products, glyceraldehyde-3- 
phosphate dehydrogenase and DXP synthase, are well known. The role of the 
product of the third, the lytB gene, is still obscure. Because certain mutations in 
this gene in Escherichia coli confer an ability to tolerate elevated levels of penicillin 
(Harkness et al., 1992; Gustafson et al., 1993) the lytB gene product would seem 
to play some role in or otherwise influence the biosynthesis of peptidoglycan. It has 
been suggested that the lytB gene product is not an enzyme directly involved in the 
pathway leading to peptidoglycan but rather exerts some control on the production 
of the global regulator molecule guanosine 3\5'-bisphosphate (ppGpp) (Gustafson 
et al., 1993; Rodionov and Ishiguro, 1995). The evidence in support of this 
suggestion is correlative and indirect and therefore by no means convincing. 
Because lytB influences the accumulation of certain isoprenoids in E. coli (see 
below), because an analysis of the nearly twenty completed, bacterial genomes 
reveals it to be one of the very few genes found only in those bacteria that also 
contain genes encoding DXP synthase and DXP reductoisomerase, and because 
we have isolated an apparently chloroplast-targeted lytB cDNA from a eucaryotic 
plant, we speculate that lytB may encode an enzyme that catalyzes one of the later 
reactions in the nonmevalonate pathway leading to IPP and DMAPP in certain 
bacteria and in plant chloroplasts. To avoid confusion, it is important to note that 
another gene, which is completely unrelated to the gene under consideration here, 
has unfortunately also been given the name lytB (see, e.g., Garcia et al M 1999), 

There remains a need for methods to enhance the production of desirable 
isoprenoids and reduce the accumulation of undesirable isoprenoids in plants, fungi 
and bgcteria that contain a nonmevalonate pathway. There also remains a need 
in the art for methods for screening for procaryotic and eukaryotic genes encoding 
enzymes of isoprenoid biosynthesis and metabolism. 

SUMMA R Y OF TH E INVENTION 
A subject of the present invention is an isolated nucleic acid sequence which 
encodes for a protein having DXP synthase enzyme activity, wherein the nucleic 
acid sequence is at least 85% identical to SEQ ID NO: 1 or the nucleic acid 
sequence encodes a protein which has an amino acid sequence which is at least 
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85% identical to SEQ ID NO: 2. 

Another subject of the present invention is an isolated nucleic acid sequence 
which encodes for a protein having GAP dehydrogenase enzyme activity, wherein 
the nucleic acid sequence is at least 85% identical to SEQ ID NO: 3 or the nucleic 
5 acid sequence encodes a protein which has an amino acid sequence which is at 
least 85% identical to SEQ ID NO: 4. 

A further subject of the present invention is an isolated nucleic acid 
sequence which encodes for a protein having LYTB activity, wherein the nucleic 
acid sequence is at least 85% identical to SEQ ID NO: 5 or the nucleic acid 
10 sequence encodes a protein which has an amino acid sequence which is at least 
85% identical to SEQ ID NO: 6. 

The invention also includes vectors which comprise any of the nucleic acid 
sequences listed above, and host cells transformed with such vectors. 

Another subject of the present invention is a method of enhancing the 
15 production of IPP, DMAPP and/or an isoprenoid compound in a host cell, 
comprising inserting into the host cell a vector comprising a heterologous nucleic 
acid sequence which encodes for a protein having DXP synthase, GAP 
dehydrogenase and/or LYTB activity, wherein the heterologous nucleic acid 
sequence is operably linked to a promoter; and expressing the heterologous nucleic 
20 acid sequence, thereby producing the protein. 

Yet another subject of the present invention is a method of modifying the 
production of IPP, DMAPP and/or an isoprenoid compound in a host cell, 
comprising inserting into the host cell a vector comprising a heterologous nucleic 
acid sequence which encodes for a protein which modifies DXP synthase, GAP 
25 dehydrogenase and/or LYTB activity in the host cell, wherein the heterologous 
nucleic acid sequence is operably linked to a promoter; and expressing the 
heterologous nucleic acid sequence, thereby producing the protein. 

The present invention also includes a method of expressing, in a host cell, 
a heterologous nucleic acid sequence which encodes for a protein having DXP 
30 synthase, GAP dehydrogenase and/or LYTB activity, the method comprising 
inserting into the host cell a vector comprising the heterologous nucleic acid 
sequence, wherein the heterologous nucleic acid sequence is operably linked to a 
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promoter; and expressing the heterologous nucleic acid sequence. 

Also included is a method of expressing, in a host cell, a heterologous 
nucleic acid sequence which encodes for a protein which modifies DXP synthase, 
GAP dehydrogenase and/or LYTB activity in the host cell, the method comprising 
5 inserting into the host cell a vector comprising the heterologous nucleic acid 
sequence, wherein the heterologous nucleic acid sequence is operably linked to a 
promoter; and expressing the heterologous nucleic acid sequence. 

Another subject of the present invention is to provide a method for screening 
for eukaryotic genes which encode enzymes involved in isoprenoid biosynthesis 
10 and metabolism. 

BRIEF DESCRIP TI ON OF TH E DRAWINGS 
A more complete appreciation of the invention and many of the attendant 
advantages thereof will be readily obtained as the same becomes better 
understood by reference to the following detailed description when considered in 
15 connection with the accompanying drawings, wherein: 

Figure 1 is an illustration showing that IPP and DMAPP serve as the central 
metabolites leading to an immense variety of different isoprenoid compounds in 
plants, bacteria, and other organisms. The mevalonic acid pathway via acetyl-CoA 
in the cytosol of plants and in animals is well characterized. A second route from 

20 pyruvate and GAP, not yet elucidated, is thought to operate in the chloroplasts of 
plants and algae, in cyanobacteria, and in many bacteria. Later reaction steps in 
the pathway remain to be determined and are therefore denoted with question 
marks. The pathway is shown leading to a box containing IPP and DMAPP 
because it has not been established which of these two is the initial product of this 

25 pathway. Abbreviations: G3P, glyceraldehyde-3-phosphate; G3PD, GAP 
dehydrogenase; DXPS, DXP synthase; IPP, isopentenyl pyrophosphate; IPI, IPP 
isomerase; DMAPP, dimethylallyl pyrophosphate; FPP, farnesyi pyrophosphate; 
GPP geranyl pyrophosphate; GGPP, geranylgeranyl pyrophosphate. 

Figure 2 illustrates a route to IPP/DMAPP from pyruvate and glyceraldehyde-3- 
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phosphate (GAP), which is thought to operate in the chloroplasts of plants and 
algae, in cyanobacteria, and in many bacteria including E. coli Later reaction steps 
in the pathway remain to be determined and are therefore denoted with question 
marks. The pathway is shown leading to a box containing IPP and DMAPP 

5 because it has not been established which of these two is the initial product of this 
pathway. Enzymes of interest are shown in bold white text in a black box. 
Abbreviations: DMAPP, dimethylallyl pyrophosphate; DXPS, deoxyxylulose-5- 
phosphate synthase; DXPR, deoxyxylulose-5-phosphate reductoisomerase; FPS, 
farnesyl pyrophosphate synthase, GAPD, glyceraldehyde-3-phosphate 

10 dehydrogenase; GGPP, geranylgeranyl pyrophosphate; IPI, isopentenyl 
pyrophosphate isomerase; IPP, isopentenyl pyrophosphate. 

Figure 3 illustrates that the C 40 carotenoid phytoene is derived by a head-to-head 
condensation of two molecules of the C 20 GGPP compound, which itself is 
assembled from 3 molecules of IPP and 1 molecule of DMAPP. Abbreviations: 
15 FPP, farnesyl pyrophosphate; GPP geranyl pyrophosphate; PPPP, prephytoene 
pyrophosphate. 

Figure 4 is a schematic illustration and restriction map of plasmid pAC-LYC 
(Cunningham et al., 1994) which contains genes (crtE, crtB, and crtl) of Erwinia 
herbicola encoding all of the enzymes required for production of the pink-colored 
20 isoprenoid pigment lycopene from the colorless IPP and DMAPP compounds. Cm, 
chloramphenicol resistance gene. 

Figure 5A is a cDNA sequence and Figure 5B is the predicted amino acid sequence 
of a putative DXP synthase isolated from a flower cDNA library of Tagetes erecta 
(SEQ ID NOS: 1 and 2). The cDNA is incorporated into the plasmid pMarDXPS. 

25 Figure 6A is a cDNA sequence and Figure 6B is the predicted amino acid sequence 
of a chloroplast isoform of GAP dehydrogenase isolated from Arabidopsis thaliana 
(SEQ ID NOS: 3 and 4). The cDNA is incorporated into the plasmid pAtG3PD. 
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Figure 7 A is a cDNA sequence and Figure 7B is the predicted amino acid sequence 
of a LYTB protein isolated from an Adonis palaestina flower cDNA library (SEQ ID 
NOS; 5 and 6). The sequence is of clone Ipi3 except that the 14 bp at the N- 
terminus were obtained from the slightly longer cDNA clone Ipi18. The cDNA is 
5 incorporated into the plasmid pApLYTB. 

Figure 8 is an alignment of the predicted amino acid sequences of LYTB from 
Adonis palaestina, Synechocystis PCC6803 and £. co//. The N-terminal extension 
of the Adonis polypeptide, relative to that of Synechocystis PCC6803, is predicted 
by the program ChloroP (Emanuelsson et al. t 1999) to constitute a chloroplast 
10 transit peptide, serving to target the polypeptide to this organelle in plants. Black 
boxes with white letters are used where all three of the aligned residues are 
identical. Grey boxes with black letters are used where two of the three aligned 
residues are identical. Abbreviations: Ap, Adonis palaestina: Sy, Synechocystis 
PCC6803; Ec t Escherichia coli. 

15 Figure 9 is a schematic representation of the mapping of an E. coli genomic 
fragment to ascertain which of the genes in this fragment will enhance or impair 
lycopene accumulation in E. co//. Deletion mapping of an E. coli genomic fragment 
(the insert in plasmid pEc3.9 is essentially identical to GenBank U32768: 
4879..8819) with genes encoding DXP synthase (orf620), FPP synthase {ispA) and 

20 the small subunit of exonuclease VII (xseB). Numbers to the right indicate relative 
lycopene accumulation per mL of liquid culture with plasmids in lycopene- 
accumulating E. coli strain TOP10. The insert in pEc3.9 is oriented in the forward 
direction in the multicopy plasmid vector pBluescript SK-. The EcoR\ and Sma\ 
sites in the vector preceding the genomic fragment and a Kpn\ site following it were 

25 used, along with A/del, Smal, and Sa/I sites in the genomic DNA, to construct the 
deletion subclones illustrated. Incomplete genes and open reading frames are not 
shown. 

DESCRIPTION OF THF PREFERRED EMBODIMENTS 
The term "isoprenoid" is intended to mean any member of the class of 
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naturally occurring compounds whose carbon skeletons are composed, in part or 
entirely of isopentyl C 5 units. Preferably, the carbon skeleton is of an essential oil, 
a fragrance, a rubber, a carotenoid, or a therapeutic compound, such as paclitaxel. 
A search of the public databases (GenBank) for genes encoding 
5 homologues of the E. coli lytB gene product unearthed such genes in twelve 
bacteria and in the cyanobacterium Synechocystis PCC6803. A recent publication 
(Potter et al., 1998) discusses the distribution of this gene in various bacteria. Our 
description herein of a homologue in the plant Adonis palaestina reveals for the first 
time that this gene is present and expressed in a eucaryotic organism. A recently 

10 deposited genomic DNA sequence for the green plant Arabidopsis thaliana 
(GenBank accession number AL035521 ) contains what appears to be a gene (the 
probable coding sequence is interrupted by several apparent introns) encoding 
LYTB in this organism. The predicted sequence of this Arabidopsis LYTB is 
somewhat uncertain (a comparison of the Adonis sequence with that listed given 

15 in AL035521 for Arabidopsis suggests that several of the exon-intron junctions 
predicted for this gene in the GenBank record are incorrect), but sequence identity 
in a comparison with the Adonis sequence is ca. 80% or more. Partial cDNA 
sequences in the data base of expressed sequence tags (dbEST) that predict 
peptides with sequence similarity to portions of the Adonis LYTB sequence indicate 

20 that homologues exist and mRNAs encoding LYTB are produced in several other 
plant species including rice (D45948), loblolly pine (AA556723) and soybean 
(AI437981 ). Both the Adonis and Arabidopsis predicted amino acid sequences are 
more than 60% identical to that predicted by the cyanobacterium Synechocystis 
PCC6803 gene, and the two plant and the cyanobacterial sequences are more than 

25 30% identical to the predicted £. coli gene product. An alignment of the Adonis, 
Synechocystis PCC6803 and E. coli predicted amino acid sequences is shown in 
Figure 7. A number of regions and residues conserved in LYTB are indicated in 
this Figure. 

The term "LYTB activity" is intended to mean the ability of LYTB to affect the 
30 production of IPP, DMAPP and/or isoprenoids in a host cell containing the lytB 
gene or DNA copy of the lytB mRNA. It has not yet been confirmed that the LYTB 
protein is, in fact, an enzyme. The precise role of the LYTB protein in affecting the 

10 
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production of isoprenoids has not been established. However, whatever the 
mechanism of action of the lytB gene product, we intend to cover that mechanism 
of action by the term "LYTB activity". 

The present invention is directed to an isolated nucleic acid sequence which 
5 encodes for a protein having DXP synthase enzyme activity, wherein the nucleic 
acid sequence is at least 85% identical to SEQ ID NO: 1 or the nucleic acid 
sequence encodes a protein which has an amino acid sequence, which is at least 
85% identical to SEQ ID NO: 2. Preferably, the nucleic acid sequence is at least 
90%, at least 95% or completely identical to SEQ ID NO: 1, or the nucleic acid 

10 sequence encodes a protein which has an amino acid sequence which is at least 
90%, at least 95% or completely identical to SEQ ID NO: 2. 

Another subject of the present invention is an isolated nucleic acid sequence 
which encodes for a protein having GAP dehydrogenase enzyme activity, wherein 
the nucleic acid sequence is at least 85% identical to SEQ ID NO: 3 or the nucleic 

15 acid sequence encodes a protein which has an amino acid sequence which is at 
least 85% identical to SEQ ID NO: 4. Preferably, the nucleic acid sequence is at 
least 90%, at least 95% or completely identical to SEQ ID NO: 3, or the nucleic acid 
sequence encodes a protein which has an amino acid sequence which is at least 
90%, at least 95% or completely identical to SEQ ID NO: 4. 

20 A further subject of the present invention is an isolated nucleic acid 

sequence which encodes for a protein having LYTB activity, wherein the nucleic 
acid sequence is at least 85% identical to SEQ ID NO: 5 or the nucleic acid 
sequence encodes a protein which has an amino acid sequence which is at least 
85% identical to SEQ ID NO: 6. Preferably, the nucleic acid sequence is at least 

25 90%, at least 95% or completely identical to SEQ ID NO: 5, or the nucleic acid 
sequence encodes a protein which has an amino acid sequence which is at least 
90%, at least 95% or completely identical to SEQ ID NO: 6. 

In each case, sequence similarity is measured using sequence analysis 
software, for example, the Sequence Analysis software package of the Genetics 

30 Computer Group (University of Wisconsin Biotechnology Center, 1 710 University 
Avenue, Madison, Wisconsin 53705), MEGAIign (DNAStar, Inc., 1228 S. Park St., 
Madison, Wisconsin 53715), or MacVector (Oxford Molecular Group, 2105 S. 
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Bascom Avenue, Suite 200, Campbell, California 95008). Such software matches 
similar sequences by assigning degrees of identity to various substitutions, 
deletions, and other modifications. Conservative (i.e. similar) substitutions typically 
include substitutions within the following groups: glycine and alanine; valine, 
5 isoleucine and leucine; aspartic acid, glutamic acid, asparagine and glutamine; 
serine and threonine; lysine and arginine; and phenylalanine and tyrosine. 
Substitutions may also be made on the basis of conserved hydrophobicity or 
hydrophilicity (see Kyte and Doolittle, J. Mol. Biol. 157: 105-132 (1982)), or on the 
basis of the ability to assume similar polypeptide secondary structure (see Chou 

10 and Fasman, Adv. Enzymoi 47: 45-148 (1978)). If comparison is made between 
nucleotide sequences, preferably the length of comparison sequences is at least 
50 nucleotides, more preferably at least 60 nucleotides, at least 75 nucleotides or 
at least 100 nucleotides. It is most preferred if comparison is made between the 
nucleic acid sequences encoding the protein coding regions necessary for protein 

15 activity. If comparison is made between amino acid sequences, preferably the 
length of comparison is at least 20 amino acids, more preferably at least 30 amino 
acids, at least 40 amino acids or at least 50 amino acids. It is most preferred if 
comparison is made between the amino acid sequences in the protein coding 
regions necessary for protein activity. 

20 The present inventors have isolated eukaryotic genes encoding DXP 

synthase from Tagetes erecta (marigold), GAP dehydrogenase from Arabidopsis 
thaliana, and LYTB from Adonis palaestina. All were identified on the basis of an 
enhancement of lycopene accumulation in E. coli. The E. coli DXP synthase was 
also identified in this same way, 

25 Suitable vectors according to the present invention comprise a gene 

encoding one or more of the above-identified enzymes involved in IPP, DMAPP 
and/or isoprenoid biosynthesis or metabolism, wherein the gene is operably linked 
to a suitable promoter. Suitable promoters for the vector can be constructed using 
techniques well known in the art (see, for example, Sambrook et al„ Molecular 

30 Cloning A La boratory Manual, Cold Spring Harbor Laboratory, Cold Spring Harbor, 
NY, 1989; Ausubel et al., Current Protocols in Molecular Biology, Greene 
Publishing and Wiley Interscience, New York, 1991). Suitable vectors for 
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eukaryotic expression in plants are described in Frey et al., Plant J, (1995) 8(5):693 
and Misawa et al, 1994a. Suitable vectors for prokaryotic expression include 
pACYC184, pUC1 19, and pBR322 (available from New England BioLabs, Bevery, 
MA) and pTrcHis (Invitrogen) and pET28 (Novagen) and derivatives thereof. The 
vectors of the present invention can additionally contain regulatory elements such 
as promoters, repressors, selectable markers such as antibiotic resistance genes, 
etc., the construction of which is very well known in the art. 

One or more of the genes encoding the enzymes as described above, when 
cloned alone or in combination into a suitable expression vector, can be used to 
overexpress these enzymes in a plant expression system or to inhibit the 
expression of these enzymes. For example, a vector containing one or more of the 
genes of the invention may be used to increase the amount of isoprenoids in an 
organism and thereby alter the nutritional or commercial value or pharmacology of 
the organism. 

Therefore, the present invention includes a method of enhancing the 
production of IPP, DMAPP and/or an isoprenoid in a host cell, relative to an 
untransformed host cell, the method comprising inserting into the host cell a vector 
comprising a heterologous nucleic acid sequence which encodes for a protein 
having DXP synthase, GAP dehydrogenase and/or LYTB activity, wherein the 
heterologous nucleic acid sequence is operably linked to a promoter; and 
expressing the heterologous nucleic acid sequence. 

The invention also includes a method of modifying the production of IPP, 
DMAPP and/or an isoprenoid in a host cell, the method comprising inserting into 
the host cell a vector comprising a heterologous nucleic acid sequence which 
encodes for a protein which modifies DXP synthase, GAP dehydrogenase and/or 
LYTB activity in the host cell, relative to an untransformed host cell, wherein the 
heterologous nucleic acid sequence is operably linked to a promoter; and 
expressing the heterologous nucleic acid sequence. 

The invention further includes a method of expressing, in a host cell, a 
heterologous nucleic acid sequence which encodes for a protein having DXP 
synthase, GAP dehydrogenase and/or LYTB activity, the method comprising 
inserting into the host cell a vector comprising the heterologous nucleic acid 
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sequence, wherein the heterologous nucleic acid sequence is operably linked to a 
promoter, and expressing the heterologous nucleic acid sequence. 

The invention also includes a method of expressing, in a host cell, a 
heterologous nucleic acid sequence which encodes for a protein which modifies 
5 DXP synthase, GAP dehydrogenase and/or LYTB activity in the host cell, relative 
to an untransformed host cell, the method comprising inserting into the host cell a 
vector comprising the heterologous nucleic acid sequence, wherein the 
heterologous nucleic acid sequence is operably linked to a promoter, and 
expressing the heterologous nucleic acid sequence. 
10 Preferably, the isoprenoid comprises a compound derived from at least one 

member selected from the group consisting of geranyl pyrophosphate (GPP), 
farnesyl pyrophosphate (FPP) and geranylgeranyl pyrophosphate (GGPP). By the 
term "derived from", we mean that the isoprenoid contains at least one of the listed 
compounds as a building block. It is also preferable if the isoprenoid comprises at 
15 least one member selected from the group consisting of a diterpene, a carotenoid, 
an essential oil, a fragrance, an isoprene, a cytokinin, a rubber, a quinone, a sterol, 
a hopanoid, a triterpene, a steroid, a prenylated protein, a phytoalexin, a gibberellin, 
a tocopherol, a dolichol, a chlorophyll and a therapeutic compound. 

It is most preferred if the isoprenoid comprises at least one member selected 
20 from the group consisting of an essential oil, a fragrance, a rubber, a carotenoid 
and a therapeutic compound, such as paclitaxel. 

The heterologous nucleic acid sequence may originate from a eukaryotic or 
procaryotic cell. By the term "originate from", we intend to mean that the sequence 
information for the heterologous nucleic acid came from the eukaryotic or the 
25 procaryotic cell. However, the specific nucleic acid itself does not have to be from 
the organism. The nucleic acid may come from the organism, or it may be 
synthetically produced using recombinant nucleic acid techniques known in the art. 

Preferably, the heterologous nucleic acid sequence comprises a nucleotide 
sequence for dxps, gapd and/or lytB. It is most preferred that the heterologous 
30 nucleic acid sequence comprises a nucleotide sequence which encodes a dxps, 
gapd and/or a lytB gene and is at least 85% identical, preferably at least 90%, at 
least 95% or completely identical, to SEQ ID NO: 1 , 3 and/or 5, respectively, or the 
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nucleic acid sequence encodes a protein which has an amino acid sequence which 
is at least 90%, at least 95% or completely identical to SEQ ID NO: 2, 4 and/or 6, 
respectively. Identity is determined as noted above. 

The term "modifying the production" in the methods of the invention means 
that the amount of target compounds produced (e.g., IPP, DMAPP and/or 
isoprenoids) can be enhanced or reduced, as compared to an untransformed host 
cell. Thus, in accordance with an embodiment of the present invention, the 
production or the biochemical activity of the target compounds (or the enzymes 
which catalyze their formation) may be reduced or inhibited by a number of different 
approaches available to those skilled in the art, including but not limited to such 
methodologies or approaches as anti-sense (e.g., Gray et aL, 1992), ribozymes 
(e.g., Wegener et al., 1994), co-suppression (e.g. Fray and Grierson, 1993), 
targeted disruption of the gene (e.g., Schaefer et al., 1997), intracellular antibodies 
(e.g., see Rondon and Marasco, 1997) or whatever other approaches rely on the 
knowledge or availability of the nucleic acid sequences of the invention, or the 
proteins encoded thereby. 

Host systems according to the present invention can comprise any organism 
that utilizes a nonmevalonate (i.e., via DXP) pathway for production of IPP and/or 
DMAPP. Organisms which produce isoprenoids using IPP and/or DMAPP derived 
from a nonmevalonate pathway include plants, algae, certain bacteria, 
cyanobacteria and other photosynthetic bacteria. Transformation of these hosts 
with vectors according to the present invention can be done using standard 
techniques. See, for example, Sambrook et al., Molecular Cloning A Laboratory 
Manual, Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, 1989; Ausubel 
et al., Current Protocols in Molecular Biology , Greene Publishing and Wiley 
Interscience, New York, 1991. 

Alternatively, transgenic organisms can be constructed which include the 
nucleic acid sequences of the present invention. The incorporation of these 
sequences can allow the controlling of isoprenoid biosynthesis, content, or 
composition in the host cell. These transgenic systems can be constructed to 
incorporate sequences which allow for the overexpression of the various nucleic 
acid sequences of the present invention. Transgenic systems can also be 
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constructed which allow for the underexpression of the various nucleic acid 
sequences of the present invention. Such systems may contain anti-sense 
expression of the nucleic acid sequences of the present invention. Such anti-sense 
expression would result in the accumulation of the substrates of the enzyme 
5 encoded by the sense strand. 

Having generally described this invention, a further understanding can be 
obtained by reference to certain specific examples which are provided herein for 
purposes of illustration only and are not intended to be limiting unless otherwise 
specified. 

10 EXAMPLE 1 

Is ol ation of plant cDNAs and bacterial genes that enhance carotenoid accumulation 
'^Escherichia coli 

The plasmid pAC-LYC (Cunningham et aL, 1994 and 1996) contains genes 
encoding all of the enzymes required for the formation of lycopene from IPP and 

1 5 DMAPP (see Figure 3). Cells of Eco// containing the plasmid pAC-LYC accumulate 
the carotenoid lycopene and thereby form colonies on solid growth medium that are 
pink in color (Cunningham et al M 1994). When cDNA or genomic libraries of 
plants, cyanobacteria, or bacteria are introduced into the lycopene-accumulating 
£ coli strain (the complete methodology is contained in Cunningham etai, 1994 

20 and 1996), a rare few of the colonies formed after spreading of the cell culture onto 
solid growth medium are much deeper in color than is typical of the vast majority 
of the colonies. The enhancement of color and concomitant carotenoid pigment 
accumulation in E. coli is prima facie evidence for an enhancement of flux into and 
through the pathway or pathways leading to the immediate precursors of 

25 carotenoids and all other isoprenoid compounds: IPP and its allyiic isomer DMAPP. 
Genes encoding subsequent enzymes in the pathway leading to lycopene are 
already present in the multicopy plasmid pAC-LYC (see Figure 3). 

Among the cDNAs and genes isolated with this screening methodology are: 
Tagetes erecta (marigold) cDNAs encoding a homologue of DXP synthase (the 

30 DNA sequence of the longest is given in Figure 4), an E.coli genomic clone 
containing the DXP synthase (and no other complete open reading: a genomic 
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fragment from the Sph\ site which lies at base 5210 in the sequence listed under 
GenBank U32768 to the Ndel site at base 8003 in this GenBank sequence was 
found to enhance carotenoid accumulation in E. coli. This region encompasses the 
open reading frame earlier referred to as f620, and now as the DXP synthase), 
5 Arabidopsis thaliana cDNAs encoding a chloroplast isoform (Shih et a/., 1991 and 
1992) of the enzyme GAP dehydrogenase (the sequence of the longest of these 
isolated cDNAs is given in Figure 5),and an Adonis palaestina cDNA encoding a 
homologue of the E. coli lytB gene product (Figure 6). The enhancement of 
isoprenoid biosynthesis in E. coli (as indicated by lycopene accumulation) when 

10 cDNAs or genes encoding DXP synthase or GAP dehydrogenase are introduced 
can be understood in the context of the biochemical pathway that has been 
postulated for production of IPP/DMAPP in E. coli (and in other bacteria, 
cyanobacteria and plants). However, at the date of the provisional filing of this 
application, there had been no indications that increased flux through the pathway 

15 to IPP/DMAPP might be obtained by increasing the expression of either of these 
enzymes. A publication after the priority date of this application (Harker and 
Bramley (1999)) shows a salutary effect on isoprenoid accumulation in E. coli in 
which foreign DXP synthase genes are introduced. 

The mechanism of enhancement of isoprenoid production by introduction of 

20 cDNAs encoding LYTB is not yet known. The nonmevalonate pathway leading to 
IPP and/or DMAPP has not yet been elucidated and these cDNAs may encode an 
enzyme subsequent to DXP synthase in the pathway. Alternatively, the 
enhancement of isoprenoid accumulation may involve a mechanism less direct 
(e.g., as for GAP dehydrogenase, involvement in biochemical reactions that utilize 

25 or supply the substrates GAP and pyruvate, or by exerting a regulatory influence 
on isoprenoid pathways). 

EXAMPLE 2 

Arabidopsis thaliana, Tagetes erecta , and Adonis palaestina cDNA Libraries 

A size-fractionated 2-3 kB cDNA library of A. thaliana in lambda ZAPII 
30 (Kieber et al., 1 993) was obtained from the Arabidopsis Biological Resource Center 
at The Ohio State University (stock number CD4-15). Other size fractionated 
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libraries were also obtained (stock numbers CD4-13, CD4-14, and CD4-16). The 
cDNA libraries for Tagetes erecta (marigold) and Adonis palaestina (pheasant's 
eye) were constructed for us by Stratagene using cDNAs isolated from flower 
tissues. An aliquot of each library was treated to cause a mass excision of the 
5 cDNAs and thereby produce a phagemid library according to the instructions 
provided by the supplier of the cloning vector (Stratagene; E. coli strain XL1-Blue 
and the helper phage R408 were used). The titre of the excised phagemid was 
determined and the library was introduced into a lycopene-accumulating strain of 
E co// TOP 10 F' (this strain contained the plasmid pAC-LYC) by incubation of the 

10 phagemid with the £ coli cells for 1 5 min at 37°C. Cells had been grown overnight 
at 30°C in LB medium supplemented with 2% (w/v) maltose and 10 mM MgS0 4 
(final concentration), and harvested in 1 .5 ml microfuge tubes at a setting of 3 on 
an Eppendorf microfuge (5415C) for 10 min. The pellets wereresuspended in 10 
mM MgS0 4 to a volume equal to one-half that of the initial culture volume. 

15 Transformants were spread on large (150 mm diameter) LB agar petri plates 
containing antibiotics to provide for selection of cDNA clones (ampicillin) and 
maintenance of pAC-LYC (chloramphenicol). Approximately 10,000 colony forming 
units were spread on each plate. Petri plates were incubated at room temperature 
for 2 to 7 days to allow maximum color development. Plates were screened visually 

20 with the aid of an illuminated 3x magnifier and a low power stage-dissecting 
microscope for the rare deep pink colonies that could be observed in the 
background of paler pink colonies. 

E nhancement of Carotenoid Accumulation in E. coli by cDNAs and g enes encoding 
25 LYTB, DXP synthase, IPP isomerase and GAP Dehydrogenase Individually and in 
Combination. 

Attempts were made to maximize production of three very different 
carotenoids in £ coli engineered to accumulate these isoprenoid pigments. The 
specific carotenoids chosen were the linear and lipophilic pink compound 
30 lycopene, the bicyclic hydrocarbon p-carotene and the much more polar dihydroxy, 
bicyclic carotenoid zeaxanthin. DXP synthase, IPP isomerase, LYTB, and GAP 
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dehydrogenase had been observed to enhance the pigmentation of E. coli colonies 
growing on solid agar medium. For a more quantitative appraisal of the influence 
of these genes and cDNAs on carotenoid accumulation, the amounts of carotenoid 
pigments in liquid cultures were examined with the results given in Figure 1. The 

5 marigold DXP synthase, an Arabidopsis IPP isomerase and the Adonis LYTB 
cDNAs gave rise to significant and substantial increases in the amount of 
carotenoids accumulated per volume of culture. A partially-sequenced Arabidopsis 
cDNA (GenBank AA605545) with sequence similarity to the Adonis lytB cDNA gave 
results similar to that of the Adonis cDNA (not shown). A synechocystis PCC6803 

10 lytB gene was less effective but gave a significant enhancement as well. See Table 
1. GAP dehydrogenase was slightly detrimental to pigment accumlation in liquid 
culture, in contrast to earlier observed enhancement for cultures grown on solid 
media. The influence of this gene on pigment accumulation may depend on the 
specific growth regimen. A similar result, enhancement for cultures on solid media 

15 but reduction in liquid culture, was also obtained for an E. coli genomic fragment 
containg DXP synthase (see Example 4 below and Figure 8). A combination of 
DXP synthase with IPP isomerase was significantly more effective at enhancing 
pigment accumulation than was either of the individual cDNAs. 

EXAMPLE 4 

20 Enhancement and Reduction of Ly co pene Accumulation in E. coli containing 
multiple cop ies of genes encoding DXP s ynthase and Farnesyl Pyrophosphate 
synthase, resp ectively 

A number of the inserts in plasmids obtained from dark pink colonies 
selected in screens of the £. coli genomic library (see example 1) contained the 

25 gene encoding DXP synthase in this organism (see Sprenger et al. f 1997; Lois et 
al M 1998). Deletion mapping of one such E. coli genomic fragment indicated that 
the dark pink colony phenotype and enhanced carotenoid accumulation in liquid 
culture was conferred by a fragment containing the DXP synthase and no other 
complete open reading (Figure 8). A genomic fragment containing the ispA and 

30 xseB genes immediately upstream of dxps was, in contrast, quite inhibitory for 
carotenoid accumulation (Figure 8). This diminution in pigmentation results, we 
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surmise, from the activity of the farnesyl pyrophosphate (FPP) synthase encoded 
by ispA (Fujisaki et al. t 1990). The FPP synthase would be expected to compete 
with geranylgeranyl pyrophosphate (GGPP) synthase (Figure 1) for IPP and 
DMAPP and thereby deplete the substrate available for carotenoid biosynthesis. 

Table 1 illustrates the influence of cDNAs encoding DXP synthase, GAP 
dehydrogenase, LYTB, and IPP isomerase, individually and in combination, on the 
accumulation of lycopene, p-carotene, or zeaxanthin in three^strains of E. coli 



engineered to produce these isoprenoid compounds. 



Table 1. Enhancement of carotenoid accumulation in lycopene, b- 
carotene, and zeaxanthin-accumuiating strains of E. coli by introduction of 
plant cDNAs and cyanobacterial genes individually and in combination. 


Plasmid 


cDNA or 
Gene 
Product 


Lycopene 
% of control 


b-Carotene 
% of control 


Zeaxanthin 
% of control 


pBluescript SK- 


Control (empty 
vector) 


(100) 


(100) 


(100) 


pAtipiTrc 


Arabidopsis 
thaliana IPI 


202 ± 12 


227 ±3 


223 ±4 


pTedxps 


Tagetes erecta 
DXPS 


203+ 11 


193 ± 11 


228 ± 40 


pAdlytB 


Adonis 

palaestina LYTB 


157 ±9 


180 ±3 


168 ±2 


p6803lytB 


Synechocystis 
PCC 6803 LYTB 


116 ±11 


111 ± 1 


109 ±2 


pAtgapA 


Arabidopsis 
thaliana GAPD 


79 + 6 


nd 


79 ±3 


Data were obtained with E. coli strain TOP10 containing pAC-LYC, pAC-BETA or 
pAC-ZEAX (Cunningham et al., 1994 and 1996, Sun et al., 1996). Carotenoid 
content (per volume of culture) was measured after 48-72 h growth in LB medium. 
Cultures were inoculated with freshly grown individual colonies from agar plates. 
Values are mean ± sd for 4 to 13 individual cultures. The empty plasmid pBluescript 
SK* (from Stratagene) served as the control and was used as the vector for all of the 
plant cDNAs and the Synechocystis lytB gene. The influence of GAPD on 
accumulation of b-carotene was not determined (nd). 
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We claim: 

1. A method of enhancing the production of isopentenyl pyrophosphate (IPP), 
dimethylallyl pyrophosphate (DMAPP) and/or an isoprenoid in a host cell, relative 
to an untransformed host cell, the method comprising 

5 inserting into the host cell a vector comprising a heterologous nucleic 

acid sequence which encodes for a protein having deoxyxylulose-5-phosphate 
(DXP) synthase enzyme activity, wherein the heterologous nucleic acid sequence 
is operably linked to a promoter; and 

expressing the heterologous nucleic acid sequence, thereby 
10 producing the protein. 

2. The method of claim 1 , wherein the isoprenoid is derived from at least one 
member selected from the group consisting of geranyl pyrophosphate (GPP), 
farnesyl pyrophosphate (FPP) and geranylgeranyl pyrophosphate (GGPP). 

3. The method of claim 1, wherein the isoprenoid comprises at least one 
15 member selected from the group consisting of an essential oil, a fragrance, a 

rubber, a carotenoid and a therapeutic compound. 

4. The method of claim 1 , wherein the isoprenoid comprises a carotenoid. 

5. The method of claim 1, wherein the heterologous nucleic acid sequence 
comprises a nucleic acid sequence for dxps. 

20 6. The method of claim 5, wherein the nucleic acid sequence for dxps encodes 
an amino acid sequence which is completely identical to SEQ ID NO: 2. 

7. The method of claim 1 t wherein the host cell is selected from the group 
consisting of a bacterial cell, an algal cell and a plant cell. 

8. The method of claim 1 , wherein the heterologous nucleic acid sequence 
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originates from a eukaryotic cell. 

9. The method of claim 1, wherein the heterologous nucleic acid sequence 
originates from a procaryotic cell. 

10. A method of modifying the production of isopentenyl pyrophosphate (IPP), 
5 dimethylallyl pyrophosphate (DMAPP) and/or an isoprenoid in a host cell, the 

method comprising 

inserting into the host cell a vector comprising a heterologous nucleic 
acid sequence which encodes for a protein which modifies deoxyxylulose-5- 
phosphate (DXP) synthase enzyme activity in the host cell, relative to an 
10 untransformed host cell, wherein the heterologous nucleic acid sequence is 
operably linked to a promoter; and 

expressing the heterologous nucleic acid sequence, thereby 
producing the protein. 

1 1 . The method of claim 10, wherein the isoprenoid is derived from at least one 
15 member selected from the group consisting of geranyl pyrophosphate (GPP), 

farnesyl pyrophosphate (FPP) and geranylgeranyl pyrophosphate (GGPP). 

12. The method of claim 10, wherein the isoprenoid comprises at least one 
member selected from the group consisting of an essential oil, a fragrance, a 
rubber, a carotenoid and a therapeutic compound. 

20 13. The method of claim 10, wherein the isoprenoid comprises a carotenoid. 

14. The method of claim 10, wherein the host cell is selected from the group 
consisting of a bacterial cell, an algal cell and a plant cell. 

15. The method of claim 10, wherein the heterologous nucleic acid sequence 
originates from a eukaryotic cell. 
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16. The method of claim 10, wherein the heterologous nucleic acid sequence 
originates from a procaryotic cell. 

17. A method of expressing, in a host cell, a heterologous nucleic acid sequence 
which encodes for a protein having deoxyxylulose-5-phosphate (DXP) synthase 

5 enzyme activity, the method comprising inserting into the host cell a vector 
comprising the heterologous nucleic acid sequence, wherein the heterologous 
nucleic acid sequence is operably linked to a promoter, and expressing the 
heterologous nucleic acid sequence. 

18. The method of claim 17, wherein the heterologous nucleic acid sequence 
10 comprises a nucleic acid sequence for dxps. 

19. The method of claim 18, wherein the nucleic acid sequence for dxps 
encodes an amino acid sequence which is completely identical to SEQ ID NO: 2. 

20. The method of claim 17, wherein the host cell is selected from the group 
consisting of a bacterial cell, an algal cell and a plant cell. 

15 21. The method of claim 17, wherein the heterologous nucleic acid sequence 
originates from a eukaryotic cell 

22. The method of claim 17, wherein the heterologous nucleic acid sequence 
originates from a procaryotic cell. 

23. A method of expressing, in a host cell, a heterologous nucleic acid sequence 
20 which encodes for a protein which modifies deoxyxylulose-5-phosphate (DXP) 

synthase enzyme activity in the host cell, relative to an untransformed host cell, the 
method comprising inserting into the host cell a vector comprising the heterologous 
nucleic acid sequence, wherein the heterologous nucleic acid sequence is operably 
linked to a promoter, and expressing the heterologous nucleic acid sequence. 
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24. The method of claim 23, wherein the host cell is selected from the group 
consisting of a bacterial cell, an algal cell and a plant cell. 

25. The method of claim 23, wherein the heterologous nucleic acid sequence 
originates from a eukaryotic cell. 

26. The method of claim 23, wherein the heterologous nucleic acid sequence 
originates from a procaryotic cell. 

27. A method of enhancing the production of isopentenyl pyrophosphate (IPP), 
dimethylallyl pyrophosphate (DMAPP) and/or an isoprenoid in a host cell, relative 
to an untransformed host cell, the method comprising 

inserting into the host cell a vector comprising a heterologous nucleic acid 
sequence which encodes for a protein having glyceraIdehyde-3-phosphate (GAP) 
dehydrogenase enzyme activity, wherein the heterologous nucleic acid sequence 
is operably linked to a promoter; and 

expressing the heterologous nucleic acid sequence, thereby producing the 
protein. 

28. The method of claim 27, wherein the isoprenoid is derived from at least one 
member selected from the group consisting of geranyl pyrophosphate (GPP), 
farnesyl pyrophosphate (FPP) and geranylgeranyi pyrophosphate (GGPP). 

29. Jhe method of claim 27, wherein the isoprenoid comprises at least one 
member selected from the group consisting of an essential oil, a fragrance, a 
rubber, a carotenoid and a therapeutic compound. 

30. The method of claim 27 f wherein the isoprenoid comprises a carotenoid. 

31 . The method of claim 27, wherein the heterologous nucleic acid sequence 
comprises a nucleic acid sequence for gapd. 
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32. The method of claim 31, wherein the nucleic acid sequence encodes an 
amino acid sequence which is completely identical to SEQ ID NO: 4. 

33. The method of claim 27, wherein the host cell is selected from the group 
consisting of a bacterial cell, an algal cell and a plant cell. 

5 34. The method of claim 27, wherein the heterologous nucleic acid sequence 
originates from a eukaryotic cell. 

35. The method of claim 27, wherein the heterologous nucleic acid sequence 
originates from a procaryotic cell. 

36. A method of modifying the production of isopentenyl pyrophosphate (IPP), 
10 dimethylallyl pyrophosphate (DMAPP) and/or an isoprenoid in a host cell, the 

method comprising 

inserting into the host cell a vector comprising a heterologous nucleic acid 
sequence which encodes for a protein which modifies glyceraldehyde-3-phosphate 
(GAP) dehydrogenase enzyme activity in the host cell, relative to an untransformed 
15 host cell, wherein the heterologous nucleic acid sequence is operably linked to a 
promoter; and 

expressing the heterologous nucleic acid sequence, thereby producing the 
protein. 



37, The method of claim 36, wherein the isoprenoid is derived from at least one 
20 member selected from the group consisting of geranyl pyrophosphate (GPP), 

farnesyl pyrophosphate (FPP) and geranylgeranyl pyrophosphate (GGPP). 

38. The method of claim 36, wherein the isoprenoid comprises at least one 
member selected from the group consisting of an essential oil, a fragrance, a 
rubber, a carotenoid and a therapeutic compound. 



25 39. The method of claim 36, wherein the isoprenoid comprises a carotenoid. 
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40. The method of claim 36, wherein the host cell is selected from the group 
consisting of a bacterial cell, an algal cell and a plant cell. 

41. The method of claim 36, wherein the heterologous nucleic acid sequence 
originates from a eukaryotic cell. 

5 42. The method of claim 36, wherein the heterologous nucleic acid sequence 
originates from a procaryotic cell. 

43. A method of expressing, in a host cell, a heterologous nucleic acid sequence 
which encodes for a protein having glyceraldehyde-3-phosphate (GAP) 
dehydrogenase enzyme activity, the method comprising inserting into the host cell 

10 a vector comprising the heterologous nucleic acid sequence, wherein the 
heterologous nucleic acid sequence is operably linked to a promoter, and 
expressing the heterologous nucleic acid sequence. 

44. The method of claim 43, wherein the heterologous nucleic acid sequence 
comprises a nucleic acid sequence for gapd. 

15 45. The method of claim 44, wherein the nucleic acid sequence encodes an 
amino acid sequence which is completely identical to SEQ ID NO: 4. 

46. The method of claim 43, wherein the host cell is selected from the group 
consisting of a bacterial cell, an algal cell and a plant cell. 

47. The method of claim 43, wherein the heterologous nucleic acid sequence 
20 originates from a eukaryotic cell. 

48. The method of claim 43, wherein the heterologous nucleic acid sequence 
originates from a procaryotic cell. 

49. A method of expressing, in a host cell, a heterologous nucleic acid sequence 
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which encodes for a protein which modifies glyceraldehyde-3-phosphate (GAP) 
dehydrogenase enzyme activity in the host cell, relative to an untransformed host 
cell, the method comprising inserting into the host cell a vector comprising the 
heterologous nucleic acid sequence, wherein the heterologous nucleic acid 
5 sequence is operably linked to a promoter, and expressing the heterologous nucleic 
acid sequence. 

50. The method of claim 49, wherein the host cell is selected from the group 
consisting of a bacterial cell, an algal cell and a plant cell. 

51. The method of claim 49, wherein the heterologous nucleic acid sequence 
10 originates from a eukaryotic cell. 

52. The method of claim 49, wherein the heterologous nucleic acid sequence 
originates from a procaryotic cell. 

53. A method of enhancing the production of isopentenyl pyrophosphate (IPP), 
dimethylallyl pyrophosphate (DMAPP) and/or an isoprenoid in a host cell, relative 

15 to an untransformed host cell, the method comprising 

inserting into the host cell a vector comprising a heterologous nucleic acid 
sequence which encodes for a protein having LYTB activity, wherein the 
heterologous nucleic acid sequence is operably linked to a promoter; and 

expressing the heterologous nucleic acid sequence, thereby producing the 
20 protein.* 

54. The method of claim 53, wherein the isoprenoid is derived from at least one 
member selected from the group consisting of geranyl pyrophosphate (GPP), 
farnesyl pyrophosphate (FPP) and geranylgeranyl pyrophosphate (GGPP). 

55. The method of claim 53, wherein the isoprenoid comprises at least one 
25 member selected from the group consisting of an essential oil, a fragrance, a 

rubber, a carotenoid and a therapeutic compound. 
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56. The method of claim 53, wherein the isoprenoid comprises a carotenoid. 

57. The method of claim 53, wherein the heterologous nucleic acid sequence 
comprises a nucleic acid sequence for lytB. 

58. The method of claim 57, wherein the nucleic acid sequence encodes an 
5 amino acid sequence which is completely identical to SEQ ID NO: 6. 

59. The method of claim 53, wherein the host cell is selected from the group 
consisting of a bacterial cell, an algal cell and a plant cell. 

60. The method of claim 53, wherein the heterologous nucleic acid sequence 
originates from a eukaryotic cell. 

10 61, The method of claim 53, wherein the heterologous nucleic acid sequence 
originates from a procaryotic cell. 

62. A method of modifying the production of isopentenyl pyrophosphate (IPP), 
dimethylallyl pyrophosphate (DMAPP) and/or an isoprenoid in a host cell, the 
method comprising 

15 inserting into the host cell a vector comprising a heterologous nucleic acid 

sequence which encodes for a protein which modifies LYTB activity in the host cell, 
relative to an untransformed host cell, wherein the heterologous nucleic acid 
sequence is operably linked to a promoter; and 

expressing the heterologous nucleic acid sequence, thereby producing the 

20 protein. 

63. The method of claim 62, wherein the isoprenoid is derived from at least one 
member selected from the group consisting of geranyl pyrophosphate (GPP), 
farnesyl pyrophosphate (FPP) and geranylgeranyl pyrophosphate (GGPP). 

64. The method of claim 62, wherein the isoprenoid comprises at least one 
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member selected from the group consisting of an essential oil, a fragrance, a 
rubber, a carotenoid and a therapeutic compound. 

65. The method of claim 62, wherein the isoprenoid comprises a carotenoid. 

66. The method of claim 62, wherein the host cell is selected from the group 
5 consisting of a bacterial cell, an algal cell and a plant cell. 

67. The method of claim 62, wherein the heterologous nucleic acid sequence 
originates from a eukaryotic cell. 

68. The method of claim 62, wherein the heterologous nucleic acid sequence 
originates from a procaryotic cell. 

10 69. A method of expressing, in a host cell, a heterologous nucleic acid sequence 
which encodes for a protein having LYTB activity, the method comprising inserting 
into the host cell a vector comprising the heterologous nucleic acid sequence, 
wherein the heterologous nucleic acid sequence is operably linked to a promoter, 
and expressing the heterologous nucleic acid sequence. 

15 70. The method of claim 69, wherein the heterologous nucleic acid sequence 
comprises a nucleic acid sequence for lytB. 

71. The method of claim 69, wherein the nucleic acid sequence encodes an 
amino acid sequence which is completely identical to SEQ ID NO: 6. 

72. The method of claim 69, wherein the host cell is selected from the group 
20 consisting of a bacterial cell, an algal cell and a plant cell. 

73. The method of claim 69, wherein the heterologous nucleic acid sequence 
originates from a eukaryotic cell. 
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74. The method of claim 69, wherein the heterologous nucleic acid sequence 
originates from a procaryotic cell. 

75. A method of expressing, in a host cell, a heterologous nucleic acid sequence 
which encodes for a protein which modifies LYTB activity in the host cell, relative 

5 to an untransformed host cell, the method comprising inserting into the host cell a 
vector comprising the heterologous nucleic acid sequence, wherein the 
heterologous nucleic acid sequence is operably linked to a promoter, and 
expressing the heterologous nucleic acid sequence. 

76. The method of claim 75, wherein the host cell is selected from the group 
10 consisting of a bacterial cell, an algal cell and a plant cell. 

77. The method of claim 75, wherein the heterologous nucleic acid sequence 
originates from a eukaryotic cell. 

78. The method of claim 75, wherein the heterologous nucleic acid sequence 
originates from a procaryotic cell. 
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Figure 5A: SEQ ID NO: 1 

Tagetes erects (marigold) cDNA for DXP synthase 
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FIGURE 5A (cont.) 



1851 


aactatagtt 


1901 


create tctot 


1951 


GLO w Lwdu L. \_ a 


2001 


acraaaoctct 

J3 **** 3 w www 




taaatgga t t 


2101 


gatcgatata 


2151 


tcttagctca 


2201 


gcaaagagac 


2251 


agatactata 


2301 


tatatatata 


2351 


tattgtactt 


2401 


aatctcgtcc 



6/13 

caaagttgtt tagcagcaag 
gaccgtggcg gatgeacgat 
aacaactagc aaacgagcat 
ataggaggat ttagctctca 
acttgatgga caccttaagt 
ttgaacatgg ggctcaatct 
aagcatattg cagcaacagc 
actacatgea etcaaegtat 
aaagacaatg ctttgctgga 
tatatatata tatatatatt 
ttcttgtaac atacatggtc 



tgaagttctt aaaaaaattg 
tctgcaagcc gcttgatgga 
gaggtcctca tcacagttga 
cgtctctcat ttcttggcct 
ggagagcaat gatgttgcct 
gaccagattg ,aggaggctgg 
tttatcattg attgggggca 
aaattgaaat atgtcatgct 
ctcttttgat catatatata 
gtagttttcg atcatataca 
gcgttgataa ttgttgacat 
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Figure 5B: SEQ ID NO: 2 -jj^ 

PREDICTED AMINO ACID SEQUENCE of Tagetes erecta (marigold) cDNA for 
DXP synthase 

TRANSLATION from: 55 to: 2232 Length: 726 



1 


MALCGALKGG 


FVPIAQNGYT 


SSSLLNPSAN 


AIMPSNKRKF 


LGIVAVSKEH 


51 


ATNEHEDLTT 


MDKTTSTTLK 


YSGDKPKTPI 


LDTINYPIHM 


KNLCVEELVK 


101 


LADELREE IV 


YTVSKTGGHL 


SSSLGWELT 


VSLHHVFNTP 


EDKIIWDVGH 


151 


OAYPHKILTG 


RRSRMRTIRO 


TFGLAGFPKR 


DESNHDAFGA 


GHSSTSISAG 


201 


LGMAVGRDLL 


GKNNHVTAVI 


GDGAMTAGQA 


YEAMNNAGYL 


DSNLIIVLND 


251 


NRQFSLPTAT 


IDGPAPPVGA 


LSRSLTRLQT 


SQKFRQLREA AKEVTKQLGD 


301 


KTHEVAAKMD 


SLVKGMVGGQ 


GASMFEELGL 


YYVGPVDGHN 


LEDLVYVFDK 


351 


IKSMTAPGPV 


LVHIVTEKGK 


GYPPAEVAAD 


KMHGWKFDT 


QTGKQKKNKT 


401 


KTLSYTQYFV 


DSLVAEAKZD 


DKIVAIHAAM 


GGGTGLNTFQ 


KEFPARCFDV 


451 


GIAEQKAITF 


AAGLATEGLK 


PFCAIYSSFL 


QRGYDQWHD 


VDLQKLPVRF 


501 


AMDRAGLVGA 


DGPTHCGAFD 


TTFMACLPNM 


WMAPSCEAE 


LMNMVATAVA 


551 


IDDRPSCFRY 


PRGUGIGSIL 


PANNKGTLIE 


VGTGRVIKEG 


NRVALLGYGT 


601 


IVQSCLAASE 


VLKKIGISVT 


VADARFCKPL 


DGNLIKQLAN 


EHEVLITVEE 


651 


GSIGGFSSHV 


SHFIiALNGLL 


DGHLKWRAMM 


LPDRYIZHGA 


QSDQIEEAGL 


701 


. SSKHIAATVL 


SLIGGSKETL 


HALNV* 
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Figure 6A: SEQ ID NO; 3 3 

Arabidopsis thaliana cDNA for a chloroplasts isofomn of glyceraldehyde-3- 
phosphate (GAP) dehydrogenase. 



1 


ctgtccccaa 


gggtttcact gaattctcag 


gattgegaag 


ctcctctgct 


51 


tctcttccct 


teggcaagaa 


actttcttcc 


gatgagttcg tttccatcgt 


101 


ctccttccag 


acttctgeaa tgggaagcag 


tggtggatac 


aggaaaggtg 


151 


tgactgaggc 


caagcttaag gtggccatta 


atggattcgg 


taggateggg 


201 


aggaacttcc 


tgagatgttg geatggtege 


aaggactctc 


ctcttgatat 


251 


cattgccatt 


aatgacactg gtggcgtcaa 


gcaggcttcg 


cacttactta 


301 


aatacgactc 


tactctegga 


atctttgatg 


ctgatgtcaa 


accttctgga 


351 


gagactgcaa 


tctctgttga 


tggaaagatc 


atccaagttg 


tctctaaccg 


401 


aaacccgtct 


cttctccctt 


ggaaggagct 


aggaattgac 


attgtcatcg 


451 


aaggaaccgg 


agtgtttgcg 


gatagagaag. 


gtgcagggaa 


acacattgaa 


501 


gctggtgcca 


agaaegttat 


cattactget 


ccaggcaaag 


gagatattcc 


551 


aacttatgtc 


geeggrgtea 


atgcagatgc 


ttacagtcat 


gatgaaccta 


601 


tcatcagcaa 


tgcatccrgc 


actaccaact 


gtcttgcccc 


ctttgtcaaa 


651 


greet tgacc 


agaaaccegg 


tatcataaag 


ggtacaatga 


cgaccactca 


701 


ctcttacacc 


ggzgaccaga 


ggtrgctaga 


cgcgagccac 


cgtgatctaa 


751 


ggagagcaag 


agcacctgct 


ttgaacattg 


ttcctacntc 


tacaggagca 


801 


gctaaagctg 


egge ~ c t 


gctccctaac 


ctcaaaggaa 


aacccaacgg 


851 


gatcgctctc 


cgcg-accaa 


caccaaacgt 


atcagtggtt 


gatcccgttg 


901 


tgeaggtetc 


aaagaagaca 


tttgetgagg 


aagtcaaege 


tgec-tcaga 


951 


gattctgeag 


agaaagagct 


taaaggtata 


ctcgatgtct 


gegatgagee 


1001 


actagtgtcc 


gtcgarttca 


gatgetcaga 


tttttcaacg 


accattgatt 


1051 


catcactcac 


tatggttatg 


ggagatgata 


tggttaaggt 


gattgettgg 


1101 


tatgataatg 


aatggggtta 


ctcacagaga 


gttgttgact 


tggctgacat 


1151 


tgttgccaac 


aactggaagt 


gatttcatca 


aactttgttt 


catgtttttc 


1201 


catttttttc 


tctttctctt 


ttttacatta 


tggttctcaa 


aatatcggtg 


1251 


aagactgtat 


a 
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Figure 6B: SEQ ID NO: 4 

Predicted amino acid sequence of Arabidopsis thaliana cDNA for a chloroplasts 
isoform of glyceraldehyde-3-phosphate (GAP) dehydrogenase. 

1 VPKGFTEFSG LRSSSASLPF GKKLSSDEFV SIVSFQTSAM GSSGGYRKGV 

51 TEAKLKVAIN GFGRIGRNFL RCWHGRKDSP LDIIAINDTG GVKQASHLLK 

101 YDSTLGIFDA DVKPSGETAI SVDGKIIQW SNRNPSLLPW ^KELGIDXVIE 

151 GTGVFVDREG AGKHIEAGAX KVTITAPGKG DIPTYWGVN ADAYSHDEPI 

201 ISNASCTTNC LAPFVKVLDQ KFGIIKGTMT TTHSYTGDQR LUDASHRDLR 

251 RARAAALNIV PTSTGAAKAV ALVLPNLKGK LNGIALRVPT PNVSWDLW 

301 QVS KKTFAEE VNAAFRDSAE KELKGILDVC DEPLVSVDFR CSDFSTTIDS 

351 SLTMVMGDDM VKVIAWYDNE WGYSQRWDL ADIVANNWK* FHQTLFHVFP 

401 FFSLSLFYIM VLKISVKTV 
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Figure 7A: SEQ ID NO: 5 
Adonis pataestina cDNA 
Length: 1619 June 4, 1998 



1 


ccttcgtctc 


tgcacttctt 


tcttccatgg 


cgacttctct 


teaattctge 


51 


cgtttctcca 


ccccatctga 


tctctccttc 


cctgaaaccc 


gctcttccac 


101 


tcgcctctac 


cgatccaaga 


aaccattttc 


cgtccgttgt 


cacagtgaag 


151 


gaccttcagg 


ctcctcctcc 


acggcagtgg 


aatccgaatt 


egatgecaag 


201 


tctttccgcc 


acaaccttac 


aagaagcaag 


aactacaacc 


gaagaggttt 


251 


cggccacaaa 


gatgagactc 


ttgagcttat 


gaacagegag 


tatacaagtg 


301 


atgttattaa 


gaagttgaag 


gagaatggaa 


acgagtacag 


ttggggacct 


351 


gttacagtga 


aattggctga 


gtcgtatggg 


ttctgttggg 


gtgtggaaag 


401 


agctgttcag 


actgcttatg 


aagctagaaa 


acagtttccc 


gatgagaaga 


451 


tttggattac 


taatgagatt 


atccataatc 


caactgttaa 


caagagatta 


501 


gaagagatgg 


aagttaaaga 


gattccagtt 


ggggatgega 


agaaacattt 


551 


tgatgttgtt 


gcaaagggtg 


acgttgtgat 


tttgeegget 


tttggagctg 


601 


ccgtaagtga 


aacgc react 


ttgagcgaga 


aaaatgtcca 


gatagtcgac 


651 


acaacttgcc 


catgggcatc 


taaggtgtgg 


aattccgttg 


agaagcataa 


701 


gaagggagag 


tacacatcaa 


ttattcatgg 


taaatattcc 


cacgaggaaa 


751 


ctattgcaac 


tgcatctctt 


get gaga act 


atatcattgt 


gaagaacatg 


801 


gatgaggcaa 


tgtatgtgcg 


cgacracact 


cttggaggtg 


agcttaatgg 


851 


atccagctca 


gacaaacagg 


cacttttgga 


gaaattcaaa 


tacgeaatte 


901 


ccgaaggatt 


tgatccagac 


acagatctta 


ttaaaacagg 


cattgeaaac 


951 


caaactacaa 


tgttgaaggg 


agaaacagaa 


gatattggta* 


aacttctcga 


1001 


aaagactatg 


atgcgcaagt 


atggagttga 


gaatattaat 


gaccacttca 


1051 


taagctttaa 


cacaatatgt 


gatgetacac 


aagagagaca 


agacgecatg 


1101 


tttaagttgg 


tagaggagaa 


agtggatctc 


atattagtag 


ttggtggatg 


*1151 


gaattcaagt 


aacacctcac 


atctacaaga 


aatttctgag 


ttacgtggca 


1201 


ttccttctta 


ctggattgac 


agtgagacga 


gaataggacc 


aggaaacaag 


1251 


attagccaca 


aattgaatca 


tggtgagcta 


gttgagaccg 


aaaactggct 


1301 


cccagagggt 


cctgttacaa 


ttggtgtgac 


ateaggtget 


tctaccccag 


1351 


ataaggctgt 


tgaagatgct 


cttgtaaagg 


tgtpcgacat 


caagegegaa 


1401 


caattgttgc 


agctagcata 


gagaccagtg 


ttctgeagag 


atcaaagaac 


1451 


caccactgtt 


caatatgtgt 


gattgtatat 


gtatcatgga 


gaacacaggg 


1501 


atatagaaac 


tgtaatacct 


tgttatgtta 


tactatatgg 


atgtccttta 


1551 


ccataatgat 


gtaaattttt 


agttccatgc 


aacaatattt 


tcttgaatga 


1601 


aaaaaaaaaa 


aaaaaaaaa 
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Figure 7B: SEQ ID NO: 6 11/13 

Predicted amino acid sequence of Adonis palaestina LYTB 

Length: 465 Translate 27-1421 

1 MATSLQFCRF STPSDLSFPE TRSSTRLYRS KKPFSVRCHS EGPSGSSSTA 

51 VESBFDAKSF RHNLTRSKNY NRRGFGHKDE TLELMNSEYT SDVIKKLKEN 

101 GNEYSWGPVT VKLAESYGFC WGVERAVQIA YEARKQFPDE KIWITNEIIH 

151 NPTVNKRLEE MEVKEIPVGD GKKHFDWAK GDWILPAFG AAVSEMLTLS 

201 EKNVQIVDTT CPWVSKVWNS VEKHKKGEYT SIIHGKYSHE ETIATAS FAG 

251 KYIIVKNMDE AMYVCDYILG GELNGSSSDK QALLEKFKYA ISEGFDPDTD 

301 LIKTGIANQT TMLKGETEDI GKLLEKTMMR KYGVENINDH FISFNTICDA 

* 351 TQERQDAMFK LVEEKVDLIL WGGWNSSNT SHLQEISELR GIPSYWIDSE 

401 TRIGPGNKIS HKLNHGELVE TENWLPEGPV TIGVTSGAST PDKAVEDALV 

451 KVFDIKREQL LQLA* 
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FIGURE 8 



* » * * 50 

Ap :MATSLQFCRFST?SDLS??tTRSSrRLYRSKK?FSVRCKSSGPSGSSSTA 

* * * * 100 

Ap : VE SE FDAKS'FRHNl TRSKitifNSRGFGHKDE TLE LMNSE3TT:S DV'J. KKLKEN 
Sy : MJD TKAFKR SXH KS DNjy KRKG^G'H GE E VM G VTrfNTX2f Q r S HLIQS I R QN 



Ap : GNEYSWGPVTVK] 
Sy : N Y R L E R.G D V^T.xj 
Ec : -MQX " 




gW^EngVQ^A?EjA;RKQF.I»JD E KXWXT 
» wf3B E ffi V AM jraETiR O HiFl G D RLWTT Npj T-Tl 
"'AfSiD™ I SIVEN'AL - A*I YjGA ?IYVRH@V 





jMEVKI 
RgMLE^VNl 
"1RGAI.1 




jPV G DG KXKFD 

Idvvngzxdp.s 



* 200 
Fg A AV. SEMLT1S 

F saSs^e e-m QLXN 

Hgv|QAVRNEAK 



Ap : EKNVQT 
Sy : DiRE C2XvB 
Ec : sSdLTVfE 





* 




f 


ICS 




T 


TCP 




A 


TCP 





JWK^gE.K'EKKGE YTj 

y| w5Ts E E."k;-:icic kx h5| 

;Vi H M A R A £ R R G E E J 



j I H gkXNj: 
JLIgHAGy, 



250 

EO XXaR S STAG 
?P VE gRm- - — 



Ad : K22E3r^K2TODEAMYV;C!52rX] 
Sy : T jgjL Jr V L A S3W Q KTCI3 YZX 
Ec : ~ : 



* * 300 

■ :NG3SSD:K2 AILTKI.X YA. I S E G7D-ED TD 

X Kg GU R:Q E FX D YEA HA. H:S A GFX)iE"D. QD 

- - £CY SN?EGG-MYXVX:S PDDVWKLrVKNEE- 



ad :Eikt:g:j 



3mxicge^e3x'g^xeE3:mmrx^E v 2 n IKDS* iS-TNi 



350 



Sy : XV RXG V ANMiiMXK S EREMJG KX FEg T. L L Q K Y g P I E LJK N^FM3:FNJT 




Ap :R^I^dBm;fk: 

Sy : jffi DgMT D 
Ec :HtnH3eHvR 




EKVDLIXKVgGW 
ZDLSXMvRfgGF 
- QAEVVxBvisSK 




* * 400 

T S HgQgX SEL Rg'XgS Y WggTSE 

snrmap La : o rmBk r a f lmBd a 



* * * 

Ap : TgFjG^GiraKI STTKLNKGEXV 

Sy : dSR GpP .G0 R V E .H K ? L G K D L E V Lg Py^x « * -± 7 y. 
Ec :^D0Q-- HsHv-KEVKCV 



"XPA.GKIX3T 



GVT j GAS jj PD 
GVT = GAS jj PD 
GVT SgAS SpD 



* 450 

Tg3k aQe dalv 

TBgKvfflE-EVMK 

a53ilBqnvva 



Ap :gVFD;X-KRE;Q - LLQLA 

Sy :KILA-XjKEAQPVLE TAG 

Ec : RLQQLGGGEAI RLE G RE E N I V F E V ? KE LRVD I RE VD 



464 
406 
316 
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FIGURE 9 
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